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SEQUENCE LISTING 



<110> KIM, Young Tae 
LEE, Jae Hyung 

<120> Gene involved in the biosynthesis of carotenoid and marine 
microorganism, paracoccus haeundaesis, producing the 
carotenoid 

<130> 428.1056 

<150> PCT/KR2004/000752 
<151> 2003-03-31 

<150> ' KR2003-20222 
<151> 2003-03-31 

<150> KR2003-20023 
<151> 2003-03-31 

<160> 18 

<170> Kopatentln 1.71 

<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer for Paracoccus haeundaesis 16S rDNA 



<400> 1 

cataagtaat tatggttttg t 
21 



<210> 2 

<211> ' 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> • reverse primer for Paracoccus haeundaesis 16S rDNA 

<400> 2 
cgcttcctta gaaaggag 
18 



<210> 3 
<211> 1454 
<212> DNA 



1 



<213> Paracoccus haeundaesis 
<400> 3 

caacttgaga gtttgatcct ggctcagaac gaacgctggc ggcaggctta acacatgcaa 
60 

gtcgagcgag accttcgggt ctagcggcgg acgggtgagt aacgcgtggg aacgtgccct 
120 

tctctacgga atagccccgg gaaactggga gtaataccgt atacgccctt tgggggaaag 
180 

atttatcgga gaaggatcgg cccgcgttgg attaggtagt tggtggggta atggcccacc 
240 

aagccgacga tccatagctg gtttgagagg atgatcagcc acactgggac tgagacacgg 
300 

cccagactcc tacgggaggc agcagtgggg aatcttagac aatgggggca accctgatct 
360 

agccatgccg cgtgagtgat gaaggcctta gggttgtaaa gctctttcag ctgggaagat 
420 

aatgacggta ccagcagaag aagccccggc taactccgtg ccagcagccg cggtaatacg 
480 

gagggggcta gcgttgttcg gaattactgg gcgtaaagcg cacgtgggcg gactggaaag 
540 

tcagaggtga aatcccaggg ctcaaccttg gaactgcctt tgaaactatc agtctggagt 
600 

tcgagagagg tgagtggaat tccgagtgta gaggtgaaat tcgtagatat tcggaggaac 
660 

accagtggcg aaggcggctc actggctcga tactgacgct gaggtgcgaa agcgtgggga 
720 

gcaaacagga ttagataccc tggtagtcca cgccgtaaac gatgaatgcc agacgtcggc 
780 

aagcatgctt gtcggtgtca cacctaacgg attaagcatt ccgcctgggg agtacggtcg 
840 

caagattaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta 
900 

attcgaagca acgcgcagaa ccttaccaac ccttgacatg gcaggaccgc tggagagatt 
960 

cagctttctc gtaagagacc tgcacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt 
1020 

gagatgttcg gttaagtccg gcaacgagcg caacccacgt ccctagttgc cagcattcag 
1080 

ttgggcactc tatggaaact gccgatgata agtcggagga aggtgtggat gacgtcaagt 

2 



I- 

1140 

tctcatggcc cttacgggtt gggctacaca 
1200 

tccccaaaag ccatctcagt tcggattgtc 
1260 

cgctagtaat cgcggaacag catgccgcgg 
1320 

cccgtcacac catgggagtt ggttctaccc 
1380 

gcggccacgg taggatcagc gactggggtg 
1440 

tgcggctgga tcac 
1454~ 



<210> • 4 

<211> 6223 

<212> DNA 

<213> crt gene 

<400> 4 

gttccacgac tggggcatcc ccacgaccgc 
60 

gccggaccgg gttctggtcg ggtcgggcgg 
120 

catccgcctc ggcgcggacc tcgtggggca 
180 

cagcgccgag gccctgtccg atcacctgtc 
240 

gttctgcacc ggatcgggcg accttgcagc 
300 

gccgggtggc caatggtcgc aagcaacggg 
360 

tgcgcggatc gccggtccgg gggacaagat 
420 

gaccgccacc agcctgatcg tctcgggcgg 
480 

gcatgcgctg tggtttctgg acgcggcggc 
540 

ggggctgacc tggctgtcgg tcggtctgtt 
600 

ggtcgtgccg gggcgtccgc gcggcaatgc 



cgtgctacaa tggtggtgac agtgggttaa 
ctctgcaact cgagggcatg aagttggaat 
tgaatacgtt cccgggcctt gtacacaccg 
gacgacgctg cgctaacctt cggggggcag 
aagtcgtaac aaggtagccg taggggaacc 



gtcgctgcgc gccatcgcgc cgatgatggg 
ggtgcgtcac gggctggacg ccgcgcgggc 
ggcggcccgc gcgctgcccg ccgcgcgcca 
cgacgtcgtg acccagctgc gcatcgcgat 
gctgcgctgc gcgcctctgc tggtgccggg 
gatggaaacc ggcgatgcgg gactgtagtc 
gagcgcacat gccctgccca aggcagatct 
catcatcgcc gcgtggctgg ccctgcatgt 
gcatcccatc ctggcgatcg cgaatttcct 
cttcatcgcg catgacgcga tgcacgggtc 
ggcgatgggc cagctggtcc tgtggctgta 



♦ 

660 

tgccggattt tcgtggcgca agatgatcgt 
720 

aaccgacgac gaccccgatt tcgaccatgg 
780 

cggcacctat ttcggctggc gcgaggggct 
840 

gctgatcctg ggggatcgct ggatgtacgt 
900 

gtcgatccag ctgttcgtgt tcggcacctg 
960 

cccggaccgc cataatgcgc ggtcgtcgcg 
1020 

ctttcacttt ggtggttatc atcacgaaca 
1080 

cctgcccagc acccgcacca agggggacac 
1140 

accgtgctgg tgatggagtt gacggcctat 
1200 

ctgggctggg gctggcacaa gtcccaccac 
1260 

gacctgtacg gcctggtctt tgcggtgatc 
1320 

tgggcgccgg tcctgtggtg gatcgctttg 
1380 

gtcctgcatg acgggctggt tcatcagcgc 
1440 

tatgcccgcc gcctgtatca ggcccaccgc 
1500 

tgcgtcagct tcggcttcat ctatgcgccg 
1560 

acgtcgggcg tgctgcgggc cgaggcgcag 
1620 

caggggcggg ccttgcgaac gggctgatcg 
1680 

tgcgggtgct gctgctggat catgcggcgg 
1740 

acgaccccga tctgtcgccg cactggctgg 
1800 



caagcacatg gcccatcacc gccataccgg 
cggcccggtc cgctggtacg cgcgcttcat 
gctgctgccc gtcatcgtga cggtctatgc 
ggtcttctgg ccgctgccgt cgatcctggc 



gctgccgcac cgccccggcc acgacgcgtt 



gatcagcgac cccgtgtcgc tgctgacctg 
ccacctgcac ccgacggtgc cttggtggcg 
cgcatgacca atttcctgat cgtcgtcgcc 



tccgtccacc gttggatcat gcacggcccc 



gaggaacacg accacgcgct ggaaaagaac 
gccacggtgc tgttcacggt gggctggatc 
ggcatgaccg tctatgggct gatctatttc 



tggccgttcc gctatatccc gcgcaagggc 
ctgcaccacg cggtcgaggg acgcgaccat 



ccggtcgaca agctgaagca ggacctgaag 
gagcgcacgt gacccatgac gtgctgctgg 



ccctggcgct gcgcgcggcg cggcccgacc 
gaccgtcaga cggccatacc tggtcctgcc 
cgcggctgaa gcccctgcgc cgcgccaact 



ggcccgacca ggaggtgcgc 
1860 

cgctggacgg ggcggcgctg 

acagcgacat cgccctgctg 
1980 

aggcgggcgc ggtcctggac 
2040 

tccagaaatt cgtgggcgtc 
2100 

tgatcatgga cgcgaccgtc 
2160 

tctctccgac gcgcatcctg 
2220 

acgacgcgct ggcggcggcg 
2280 

aggtccggcg cgaacgcggc 
2340 

gggccgatca cgcggagggg 
2400 

tcaccggcta ttcgctgccc 
2460 

ggccgcccgg caccgacgcg 
2520 

gccgtgaccg ctttctgcgc 
2580 

ggcgctatac cctgctgcag 
2640 

atgccggccg gctgagcgtg 
2700 

cccttggcac ggccatccgc 
2760 

cgcccattcg cccgcggcca 
2820 

cctggc'catc cgcctgcagt 
2880 

gcccggcggg cgcgcctatg 
2940 

cgtcatcacc gaccccgatg 



tttccccgcc atgcccggcg 

gcggatgcgg tggcccggtc 
gatgaacagg gggcgacgct 

gggcgcggcg cgcagccgtc 

gagatcgaga ccgactgccc 

acccagcagg acgggtaccg 

atcgaggaca ctcgctattc 

tcccacgact atgcccgcca 

atcctgccca ttgcgctggc 

cctgttcccg tgggactgcg 

tatgcggcgc aggtggcgga 

ctgcgcggcg ccatccgcga 

ctgctgaacc ggatgctgtt 

cggttctacc gcatgccgca 

gcggatcagc tgcgcatcgt 

tgcctgcccg aacgtcccct 

agaccgccat cgtgatcggc 

ccgcgggcat cgccaccacc 

tctggcacga tcagggccat 

cgctcaagga gctgtgggcg 



gctggccacc ggttacgggt 

gggcgccgag atccgctgga 1 
gtcctgcggc acccggatcg 

gcggcatctg accgtgggtt 

ccacggcgtg ccccgcccga 

attcatctat ctgctgccct 

cgatggcggc aatctggacg 

gcagggctgg accggggccg 

ccatgacgcg gcgggcttct 

cgcggggttc tttcacccgg 

cgtggtggcg ggcctgtccg 

ttacgcgatc gaccgggcac 

ccgcggctgc gcgcccgacc 

tggactgatc gaacggttct 

gaccggcaag cctcccattc 

gctgaaggaa aacgcatgaa 

gcaggctttg gcgggctggc 

ctggtcgagg cccgggacaa 

gtcttcgacg cgggcccgac 

ctgaccgggc aggacatggc 



> 

3000 

gcgcgacgtg acgctgatgc cggtgtcgcc 
3060 

ggtcttcgat tacgtgaacg aggccgatca 
3120 

ggacgacctg gaaggatacc gccgcttccg 
3180 

ctacgtcaag ctgggcaccg tgcccttcct 
3240 

cgcgctgatg aagctggagg cctataagtc 
3300 • 

ggacccctat ctgcggcagg cgttttcgta 
3360 

ctcgaccagc tcgatctatg cgctgatcca 
3420 

cgccaagggc ggcaccaacc agctggtcgc 
3480 

cggccagatg atgctgaacg ccaaggtcgc 
3540 

gggcgtcacc ctggcggacg ggcggtcttt 
3600 

cgtcatgcac aactatcgcg acctgctggg 
3660 

atcgctggac cgcaagcgct ggtccatgtc 
3720 

ggcgcccaag gacatcgcgc atcacaccat 
3780 

caacgagatc ttcaagggcc cgaagctggc 
3840 

ctgcacgacc gatccggaca tggcgcctcc 
3900 

cgtgccgcat ctgggccgcg ccgagatcga 
3960 

ccgcatcctg gcgtccctgg aggagcggct 
4020 

gacgcgcatc ttcacgcccg ccgatttcgc 
4080 

cttctcggtc gagccgatcc tgacgcaatc 
4140 



cttctatcga ctgatgtggc cgggcgggaa 

gctggagcgc cagatcgccc agttcaaccc 

tgattacgcg gaggaggtgt atcaggaggg 

caagctgggc cagatgctca aggccgcgcc 

cgtccatgcc aaggtcgcga ccttcatcaa 

tcacacgctg ctggtgggcg ggaatccctt 

cgcgctggag cggcgcggcg gggtctggtt 

gggcatggtc gcgctgttcg aacggcttgg 

ccggatcgag accgagggcg cgcggaccac 

aagggccgac atggtcgcca gcaacggcga 

ccacacggcc cgcgggcaga gccgcgcgaa 

gttgttcgtg ctgcatttcg gtctgcgcga 

cctgttcggc ccccgctaca gggagctggt 

cgaggatttc tcgctgtacc tgcattcgcc 

gggcatgtcc acgcattacg tgctggcccc 

ttgggcggtc gaggggccgc gctatgccga 

gatcccgaac ctgcgcgcca acctgaccac 

cagcgaactg aacgcccatc acggcagcgc 

cgcgtggttc cggccgcaca accgcgacaa 



gacgatccgc aacttctatc tggtcggcgc 
4200 

cgtcgtgggc tcggccaagg ccacggccca 
4260 

agcgatctgg tcctgacctc gaccgaggcg 
4320 

gcggccaagc tgatgccgcc gggcatccgc 
4380 

cgccacgcgg atgacgtgat cgacggtcag 
4440 

gacccgcagg cgcggctgga cggcctgcgc 
4500 

ggtccggtga ccccgccctt tgccgcgctg 
4560 

caggcctggc ccatggacct gatcgaaggc 
4620 

cgcacgctgg atgacgtgct ggaatattcc 
4680 

atggcccgcg tgatgggcgt gcgcgacgat 
4740 

ctggcgttcc agctgaccaa catcgcgcgc 
4800 

tgctatctgc cgggggactg gctggaccag 
4860 

tcgccggagc tgtacacagt gatcctccgg 
4920 

tcggcgcggg tgggtctggc ggatctgcca 
4980 

ctacggatct atcgcgccat cgggctgcgc 
5040 

cagcggatca gcacgtccaa ggctgccaag 
5100 

gtcgcgcgat cacgcctgcc gggggcgggc 
5160 

catcacgtct aggcgcgcgc ggcgtagggc 
5220 

ggagcctgaa ggcgcttgct gcgcagcatc 
5280 



gggcacccat ccgggcgcgg gcattccggg 
ggtgatgctg tccgacctgg cgggcgcatg 
atcacccaag ggtcgcaaag ctttgccacg 
gacgacacgg tgatgctcta tgcctggtgc 
gccctgggca gccgccccga ggcggtgaac 
gtcgacacgc tggcggccct gcagggcgac 
cgcgcggtgg cgcggcggca tgatttcccg 
ttcgcgatgg atgtcgaggc gcgcgactat 
tatcacgtcg caggcatcgt cggcgtgatg 
cctgtcctgg accgcgcctg cgacctgggg 
gacgtgatcg acgatgcgcg catcgggcgg 
gcgggcgcgc ggatcgacgg gccggtgccg 
ctgttggatg aggcggaacc ctattacgcg 
ccgcgctgcg cctggtccat cgccgccgcg 
atccgcaaga gcgggccgca ggcctatcgc 
atcggcctgc tgggcgtcgg gggctgggat 
gtgtcgcggc agggcctctg gacccggccg 
agaacccgtt ccagcagggc cgcgatttcc 
gcgtccagtt gggcgcggct ggcctcgtaa 



7 



tgacgggaca cgttctgcag gtctgacacg 
5340 

gcggcatcgc gaccggtatc cttgccaagc 
5400 

tcgtcatagg actggaacac gcggcccagc 
5460 

tcctcggcgt cgaactcctt gatcacggcc 
5520 

ccggtcttca ggtcctgttc ctgttcgacc 
5580 

aggtcctggc cggcgcacag gccctgcggc 
5640 

tgcgcccgca ccgtgcccga cgcgccgcgc 
5700 

atcagggcga tgccgcccag cacggcacgg 
5760 

cggccgcggc gcagcccggc atcgtccatg 
5820 

gcatgcacca tctcgaccgc gcaggcggcg 
5880 

gcctctgccg caagcagcat cagcatgccg 
5940 

tggctcatgg ccgcgccgag cggctgcgac 
6000 

agtctggtct gcagaagggt ggcgtggatc 
6060 

gcgcttgggt tctgacctgg cgggaaggtc 
6120 

ccaccgtcaa cagtccccat gttggaacgg 
6180 

cgacgcgggg tcgcgcggca atttgtccaa 
6223 



<210> 5 

<211> 729 

<212> DNA 

<213> crtW gene 

<400> 5 

atgagcgcac atgccctgcc caaggcagat 
60 



gccagaaggc cgcgccgcgg gccgggggcc 
gccgcctggt cgcccacgac gtccagcagg 
tgacggccaa agtcgatcat ctgggtctgc 
agcatctcca gcccggcgat gaacagcacg 
cccgcgccgt tcttggccgc gtgcaggtcc 
cccagggacc gcgacaggat ccgcaccagc 
gcaccggcca gcagggccat tgcctcggtg 
ctttcgccat gcgccacatg ggtcgcgggc 
cagggcaggt cgtcgaagat cagcgatgcg 
tcgacgatcg tgtcgcagac cccgcccgag 
cggaaccgcc tgcccgacga cagcgcgcca 
acggcaccga atccctgggc gatctcctca 
gggttgacgt ctcgtctcat cagtgccttc 
aggccggggc ggcaccccgt gacccgtcat 
ttcacgcccg attgcgagcc ttttcgacgg 
caaggtcagt gga 



ctgaccgcca ccagcctgat cgtctcgggc 



8 



ggcatcatcg ccgcgtggct ggccctgcat gtgcatgcgc tgtggtttct ggacgcggcg 
120 

gcgcatccca tcctggcgat cgcgaatttc ctggggctga cctggctgtc ggtcggtctg 
180 

ttcttcatcg cgcatgacgc gatgcacggg tcggtcgtgc cggggcgtcc gcgcggcaat 
240 

gcggcgatgg gccagctggt cctgtggctg tatgccggat tttcgtggcg caagatgatc 
300 

gtcaagcaca tggcccatca ccgccatacc ggaaccgacg acgaccccga tttcgaccat 
360 

ggcggcccgg tccgctggta cgcgcgcttc atcggcacct atttcggctg gcgcgagggg 
420 

ctgctgctgc ccgtcatcgt gacggtctat gcgctgatcc tgggggatcg ctggatgtac 
480 

gtggtcttct ggccgctgcc gtcgatcctg gcgtcgatcc agctgttcgt gttcggcacc 
540 

tggctgccgc accgccccgg ccacgacgcg ttcccggacc gccataatgc gcggtcgtcg 
600 

cggatcagcg accccgtgtc gctgctgacc tgctttcact ttggtggtta tcatcacgaa 
660 

caccacctgc acccgacggt gccttggtgg cgcctgccca gcacccgcac caagggggac 
720 

accgcatga 
729 



<210> 6 
<211> 242 
<212> PRT 

<213> crtW amino acid 
<400> 6 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 

15 10 15 

He Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 

20 25 30 

Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro He Leu Ala He Ala 

35 40 45 

Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe Phe lie Ala 
50 55 60 

His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Gly Asn 

65 70 75 

80 



9 



Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp 

85 90 95 

Arg Lys Met He Val Lys His Met Ala His His Arg His Thr Gly Thr 

100 105 HO 

Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 
115 120 125 

Arg Phe He Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 
130 135 140 

Val He Val Thr Val Tyr Ala Leu He Leu Gly Asp Arg Trp Met Tyr 

145 150 155 

160 

Val Val Phe Trp Pro Leu Pro Ser lie Leu Ala Ser He Gin Leu Phe 

165 170 175 

Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 

180 185 190 

Asp Arg His Asn Ala Arg Ser Ser Arg He Ser Asp Pro Val Ser Leu 
195 200 205 

Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 

Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 

225 230 235 

240 

Thr Ala 



<210> 7 

<211> 489 

<212> DNA 

<213> crtZ gene 

<400> 7 

atgaccaatt tcctgatcgt cgtcgccacc gtgctggtga tggagttgac ggcctattcc 
60 

gtccaccgtt ggatcatgca cggccccctg ggctggggct ggcacaagtc ccaccacgag 
120 

gaacacgacc acgcgctgga aaagaacgac ctgtacggcc tggtctttgc ggtgatcgcc 
180 

acggtgctgt tcacggtggg ctggatctgg gcgccggtcc tgtggtggat cgctttgggc 
240 

atgaccgtct atgggctgat ctatttcgtc ctgcatgacg ggctggttca tcagcgctgg 
300 



10 



ccgttccgct atatcccgcg caagggctat gcccgccgcc tgtatcaggc ccaccgcctg 
360 

caccacgcgg tcgagggacg cgaccattgc gtcagcttcg gcttcatcta tgcgccgccg 
420 

gtcgacaagc tgaagcagga cctgaagacg tcgggcgtgc tgcgggccga ggcgcaggag 
480 

cgcacgtga 
489 



<210> 8 
<211> ' 162 
<212> PRT 

<213> crtZ amino acid 
<400> 8 

Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 

15 10 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 

20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 

35 40 45 

Asn Asp Leu Tyr Gly Leu Val Phe Ala Val He Ala Thr Val Leu Phe 
50 55 60 

Thr Val Gly Trp He Trp Ala Pro Val Leu Trp Trp He Ala Leu Gly 

65 70 75 

80 

Met Thr Val Tyr Gly Leu He Tyr Phe Val Leu His Asp Gly Leu Val 

85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr Ala Arg 

100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Thr Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 

145 150 155 

160 

Arg Thr 



<210> 9 
<211> 1161 
<212> DNA 



11 



<213> crtY gene 
<400> 9 

gtgacccatg acgtgctgct ggcaggggcg 
60 

ctgcgcgcgg cgcggcccga cctgcgggtg 
120 

gacggccata cctggtcctg ccacgacccc 
180 

aagcccctgc gccgcgccaa ctggcccgac 
240 

cggctggcca ccggttacgg gtcgctggac 
300 

tcgggcgccg agatccgctg gaacagcgac 
360 

ctgtcctgcg gcacccggat cgaggcgggc 
420 

tcgcggcatc tgaccgtggg tttccagaaa 
480 

ccccacggcg tgccccgccc gatgatcatg 
540 

cgattcatct atctgctgcc cttctctccg 
600 

tccgatggcg gcaatctgga cgacgacgcg 
660 

cagcagggct ggaccggggc cgaggtccgg 
720 

gcccatgacg cggcgggctt ctgggccgat 
780 

cgcgcggggt tctttcaccc ggtcaccggc 
840 

gacgtggtgg cgggcctgtc cgggccgccc 
900 

gattacgcga tcgaccgggc acgccgtgac 
960 

ttccgcggct gcgcgcccga ccggcgctat 
1020 

catggactga tcgaacggtt ctatgccggc 
1080 

gtgaccggca agcctcccat tccccttggc 



ggccttgcga acgggctgat cgccctggcg 
ctgctgctgg atcatgcggc gggaccgtca 
gatctgtcgc cgcactggct ggcgcggctg 
caggaggtgc gctttccccg ccatgcccgg 



ggggcggcgc tggcggatgc ggtggcccgg 
atcgccctgc tggatgaaca gggggcgacg 



gcggtcctgg acgggcgcgg cgcgcagccg 
ttcgtgggcg tcgagatcga gaccgactgc 
gacgcgaccg tcacccagca ggacgggtac 
acgcgcatcc tgatcgagga cactcgctat 
ctggcggcgg cgtcccacga ctatgcccgc 
cgcgaacgcg gcatcctgcc cattgcgctg 
cacgcggagg ggcctgttcc cgtgggactg 
tattcgctgc cctatgcggc gcaggtggcg 
ggcaccgacg cgctgcgcgg cgccatccgc 
cgctttctgc gcctgctgaa ccggatgctg 
accctgctgc agcggttcta ccgcatgccg 
cggctgagcg tggcggatca gctgcgcatc 
acggccatcc gctgcctgcc cgaacgtccc 



1140 



ctgctgaagg aaaacgcatg a 
1161 



<210> 10 
<211> 386 
<212> PRT 

<213> crtY amino acid 
<400> 10 

Val Thr His Asp Val Leu Leu Ala Gly Ala Gly Leu Ala Asn Gly Leu 

15 10 15 

lie Ala Leu Ala Leu Arg Ala Ala Arg Pro Asp Leu Arg Val Leu Leu 

20 25 30 

Leu Asp His Ala Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His 

35 40 45 

Asp Pro Asp Leu Ser Pro His Trp Leu Ala Arg Leu Lys Pro Leu Arg 
50 55 60 

Arg Ala Asn Trp Pro Asp Gin Glu Val Arg Phe Pro Arg His Ala Arg 

65 70 75 

80 

Arg Leu Ala Thr Gly Tyr Gly Ser Leu Asp Gly Ala Ala Leu Ala Asp 

85 90 95 

Ala Val Ala Arg Ser Gly Ala Glu lie Arg Trp Asn Ser Asp lie Ala 

100 105 110 

Leu Leu Asp Glu Gin Gly Ala Thr Leu Ser Cys Gly Thr Arg He Glu 
115 120 125 

Ala Gly Ala Val Leu Asp Gly Arg Gly Ala Gin Pro Ser Arg His Leu 
130 135 140 

Thr Val Gly Phe Gin Lys Phe Val Gly Val Glu He Glu Thr Asp Cys 

145 150 155 

160 

Pro His Gly Val Pro Arg Pro Met He Met Asp Ala Thr Val Thr Gin 

165 170 175 

Gin Asp Gly Tyr Arg Phe He Tyr Leu Leu Pro Phe Ser Pro Thr Arg 

180 185 190 

He Leu lie Glu Asp Thr Arg Tyr Ser Asp Gly Gly Asn Leu Asp Asp 
195 200 205 

Asp Ala Leu Ala Ala Ala Ser His Asp Tyr Ala Arg Gin Gin Gly Trp 
210 215 220 

Thr Gly Ala Glu Val Arg Arg Glu Arg Gly He Leu Pro He Ala Leu 
225 230 235 



13 



240 

Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala Glu Gly Pro Val 

245 250 255 

Pro Val Gly Leu Arg Ala Gly Phe Phe His Pro Val Thr Gly Tyr Ser 

260 265 270 

Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 

275 280 285 

Pro Pro Gly Thr Asp Ala Leu Arg Gly Ala lie Arg Asp Tyr Ala lie 
290 295 300 

Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu 

305 310 315 

320 

Phe Arg Gly Cys Ala Pro Asp Arg Arg Tyr Thr Leu Leu Gin Arg Phe 

325 330 335 

Tyr Arg Met Pro His Gly Leu lie Glu Arg Phe Tyr Ala Gly Arg Leu 

340 345 350 

Ser Val Ala Asp Gin Leu Arg lie Val Thr Gly Lys Pro Pro lie Pro 
355 360 365 

Leu Gly Thr Ala lie Arg Cys Leu Pro Glu Arg Pro Leu Leu Lys Glu 
370 375 380 

Asn Ala 
385 



<210> 11 

<211> 1506 

<212> DNA 

<213> crtl gene 

<400> 11 

atgaacgccc attcgcccgc ggccaagacc gccatcgtga tcggcgcagg ctttggcggg 
60 

ctggccctgg ccatccgcct gcagtccgcg ggcatcgcca ccaccctggt cgaggcccgg 
120 

gacaagcccg gcgggcgcgc ctatgtctgg cacgatcagg gccatgtctt cgacgcgggc 
180 

ccgaccgtca tcaccgaccc cgatgcgctc aaggagctgt gggcgctgac cgggcaggac 
240 

atggcgcgcg acgtgacgct gatgccggtg tcgcccttct atcgactgat gtggccgggc 
300 

gggaaggtct tcgattacgt gaacgaggcc gatcagctgg agcgccagat cgcccagttc 
3 60 



14 



aacccggacg acctggaagg ataccgccgc 
420 

gagggctacg tcaagctggg caccgtgccc 
480 

gcgcccgcgc tgatgaagct ggaggcctat 
540 

atcaaggacc cctatctgcg gcaggcgttt 
600 

cccttctcga ccagctcgat ctatgcgctg 
660 

tggttcgcca agggcggcac caaccagctg 
720 

cttggcggcc agatgatgct gaacgccaag 
780 

accacgggcg tcaccctggc ggacgggcgg 
840 

ggcgacgtca tgcacaacta tcgcgacctg 
900 

gcgaaatcgc tggaccgcaa gcgctggtcc 
960 

cgcgaggcgc ccaaggacat cgcgcatcac 
1020 

ctggtcaacg agatcttcaa gggcccgaag 
1080 

tcgccctgca cgaccgatcc ggacatggcg 
1140 

gcccccgtgc cgcatctggg ccgcgccgag 
1200 

gccgaccgca tcctggcgtc cctggaggag 
1260 

accacgacgc gcatcttcac gcccgccgat 
1320 

agcgccttct cggtcgagcc gatcctgacg 
1380 

gacaagacga tccgcaactt ctatctggtc 
1440 

ccgggcgtcg tgggctcggc caaggccacg 
1500 

gcatga 



ttccgtgatt acgcggagga ggtgtatcag 

ttcctcaagc tgggccagat gctcaaggcc 

aagtccgtcc atgccaaggt cgcgaccttc 

tcgtatcaca cgctgctggt gggcgggaat 

atccacgcgc tggagcggcg cggcggggtc 

gtcgcgggca tggtcgcgct gttcgaacgg 

gtcgcccgga tcgagaccga gggcgcgcgg 

tctttaaggg ccgacatggt cgccagcaac 

ctgggccaca cggcccgcgg gcagagccgc 

atgtcgttgt tcgtgctgca tttcggtctg 

accatcctgt tcggcccccg ctacagggag 

ctggccgagg atttctcgct gtacctgcat 

cctccgggca tgtccacgca ttacgtgctg 

atcgattggg cggtcgaggg gccgcgctat 

cggctgatcc cgaacctgcg cgccaacctg 

ttcgccagcg aactgaacgc ccatcacggc 

caatccgcgt ggttccggcc gcacaaccgc 

ggcgcgggca cccatccggg cgcgggcatt 

gcccaggtga tgctgtccga cctggcgggc 
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<210> 12 
<211> 501 
<212> PRT 

<213> crtl amino acid 
<400> 12 

Met Asn Ala His Ser Pro Ala Ala Lys Thr Ala He Val He Gly Ala 

15 10 15 

Gly Phe Gly Gly Leu Ala Leu Ala He Arg Leu Gin Ser Ala Gly He 

20 25 30 

Ala Thr Thr Leu Val Glu Ala Arg Asp Lys Pro Gly Gly Arg Ala Tyr 

35 40 45 

Val Trp His Asp Gin Gly His Val Phe Asp Ala Gly Pro Thr Val He 
50 55 60 

Thr Asp Pro Asp Ala Leu Lys Glu Leu Trp Ala Leu Thr Gly Gin Asp 

65 70 75 

80 

Met Ala Arg Asp Val Thr Leu Met Pro Val Ser Pro Phe Tyr Arg Leu 

85 90 95 

Met Trp Pro Gly Gly Lys Val Phe Asp Tyr Val Asn Glu Ala Asp Gin 

100 105 110 

Leu Glu Arg Gin He Ala Gin Phe Asn Pro Asp Asp Leu Glu Gly Tyr 
115 120 125 

Arg Arg Phe Arg Asp Tyr Ala Glu Glu Val Tyr Gin Glu Gly Tyr Val 
130 135 140 

Lys Leu Gly Thr Val Pro Phe Leu Lys Leu Gly Gin Met Leu Lys Ala 

145 150 155 

160 

Ala Pro Ala Leu Met Lys Leu Glu Ala Tyr Lys Ser Val His Ala Lys 

165 170 175 

Val Ala Thr Phe He Lys Asp Pro Tyr Leu Arg Gin Ala Phe Ser Tyr 

180 185 190 

His Thr Leu Leu Val Gly Gly Asn Pro Phe Ser Thr Ser Ser He Tyr 
195 200 205 

Ala Leu He His Ala Leu Glu Arg Arg Gly Gly Val Trp Phe Ala Lys 
210 215 220 

Gly Gly Thr Asn Gin Leu Val Ala Gly Met Val Ala Leu Phe Glu Arg 

225 230 235 

240 



Leu Gly Gly Gin Met Met Leu Asn Ala Lys Val Ala Arg He Glu Thr 

16 



245 250 255 

Glu Gly Ala Arg Thr Thr Gly Val Thr Leu Ala Asp Gly Arg Ser Leu 

260 265 270 

Arg Ala Asp Met Val Ala Ser Asn Gly Asp Val Met His Asn Tyr Arg 

275 280 285 

Asp Leu Leu Gly His Thr Ala Arg Gly Gin Ser Arg Ala Lys Ser Leu 
290 295 300 

Asp Arg Lys Arg Trp Ser Met Ser Leu Phe Val Leu His Phe Gly Leu 

305 310 315 

320 

Arg Glu Ala Pro Lys Asp He Ala His His Thr He Leu Phe Gly Pro 

325 330 335 

Arg Tyr Arg Glu Leu Val Asn Glu He Phe Lys Gly Pro Lys Leu Ala 

340 345 350 

Glu Asp Phe Ser Leu Tyr Leu His Ser Pro Cys Thr Thr Asp Pro Asp 
355 360 365 

Met Ala Pro Pro Gly Met Ser Thr His Tyr Val Leu Ala Pro Val Pro 
370 375 380 

His Leu Gly Arg Ala Glu He Asp Trp Ala Val Glu Gly Pro Arg Tyr 

385 390 395 

400 

Ala Asp Arg He Leu Ala Ser Leu Glu Glu Arg Leu He Pro Asn Leu 

405 410 415 

Arg Ala Asn Leu Thr Thr Thr Arg He Phe Thr Pro Ala Asp Phe Ala 

420 425 430 

Ser Glu Leu Asn Ala His His Gly Ser Ala Phe Ser Val Glu Pro lie 

435 440 445 

Leu Thr Gin Ser Ala Trp Phe Arg Pro His Asn Arg Asp Lys Thr lie 
450 455 460 

Arg Asn Phe Tyr Leu Val Gly Ala Gly Thr His Pro Gly Ala Gly lie 

465 470 475 

480 

Pro Gly Val Val Gly Ser Ala Lys Ala Thr Ala Gin Val Met Leu Ser 

485 490 495 

Asp Leu Ala Gly Ala 

500 



<210> 13 

<211> 915 

<212> DNA 

<213> crtB gene 



17 



» 

t 



<400> 13 

atgagcgatc tggtcctgac ctcgaccgag gcgatcaccc aagggtcgca aagctttgcc 

60 

acggcggcca agctgatgcc gccgggcatc cgcgacgaca cggtgatgct ctatgcctgg 
120 

tgccgccacg cggatgacgt gatcgacggt caggccctgg gcagccgccc cgaggcggtg 
180 

aacgacccgc aggcgcggct ggacggcctg cgcgtcgaca cgctggcggc cctgcagggc 
240 

gacggtccgg tgaccccgcc ctttgccgcg ctgcgcgcgg tggcgcggcg gcatgatttc 
300 

ccgcaggcct ggcccatgga cctgatcgaa ggcttcgcga tggatgtcga ggcgcgcgac 
360 

tatcgcacgc tggatgacgt gctggaatat tcctatcacg tcgcaggcat cgtcggcgtg 
420 

atgatggccc gcgtgatggg cgtgcgcgac gatcctgtcc tggaccgcgc ctgcgacctg 
480 

gggctggcgt tccagctgac caacatcgcg cgcgacgtga tcgacgatgc gcgcatcggg 
540 

cggtgctatc tgccggggga ctggctggac caggcgggcg cgcggatcga cgggccggtg 
600 

ccgtcgccgg agctgtacac agtgatcctc cggctgttgg atgaggcgga accctattac 
660 

gcgtcggcgc gggtgggtct ggcggatctg ccaccgcgct gcgcctggtc catcgccgcc 
720 

gcgctacgga tctatcgcgc catcgggctg cgcatccgca agagcgggcc gcaggcctat 
780 

cgccagcgga tcagcacgtc caaggctgcc aagatcggcc tgctgggcgt cgggggctgg 
840 

gatgtcgcgc gatcacgcct gccgggggcg ggcgtgtcgc ggcagggcct ctggacccgg 
900 

ccgcatcacg tctag 
915 



<210> 14 
<211> 304 
<212> PRT 

<213> crtB amino acid 
<400> 14 

Met Ser Asp Leu Val Leu Thr Ser Thr Glu Ala He Thr Gin Gly Ser 
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1 5 10 

Gin Ser Phe Ala Thr Ala Ala Lys Leu Met Pro Pro Gly He Arg Asp 

20 25 30 

Asp Thr Val Met Leu Tyr Ala Trp Cys Arg His Ala Asp Asp Val He 

35 40 45 

Asp Gly Gin Ala Leu Gly Ser Arg Pro Glu Ala Val Asn Asp Pro Gin 
50 55 60 

Ala Arg Leu Asp Gly Leu Arg Val Asp Thr Leu Ala Ala Leu Gin Gly 

65 70 75 

80 

Asp Gly Pro Val Thr Pro Pro Phe Ala Ala Leu Arg Ala Val Ala Arg 

85 90 

Arg His Asp Phe Pro Gin Ala Trp Pro Met Asp Leu He Glu Gly Phe 

100 105 HO 

Ala Met Asp Val Glu Ala Arg Asp Tyr Arg Thr Leu Asp Asp Val Leu 
115 120 125 

Glu Tyr Ser Tyr His Val Ala Gly He Val Gly Val Met Met Ala Arg 
130 135 140 

Val Met Gly Val Arg Asp Asp Pro Val Leu Asp Arg Ala Cys Asp Leu 

145 150 155 

160 

Gly Leu Ala Phe Gin Leu Thr Asn lie Ala Arg Asp Val He Asp Asp 

165 170 

Ala Arg He Gly Arg Cys Tyr Leu Pro Gly Asp Trp Leu Asp Gin Ala 

180 185 190 

Gly Ala Arg He Asp Gly Pro Val Pro Ser Pro Glu Leu Tyr Thr Val 
195 200 205 

He Leu Arg Leu Leu Asp Glu Ala Glu Pro Tyr Tyr Ala Ser Ala Arg 
210 215 220 

Val Gly Leu Ala Asp Leu Pro Pro Arg Cys Ala Trp Ser He Ala Ala 

225 230 235 

240 

Ala Leu Arg He Tyr Arg Ala lie Gly Leu Arg lie Arg Lys Ser Gly 

245 250 

Pro Gin Ala Tyr Arg Gin Arg He Ser Thr Ser Lys Ala Ala Lys lie 

260 265 270 

Gly Leu Leu Gly Val Gly Gly Trp Asp Val Ala Arg Ser Arg Leu Pro 

275 280 285 



Gly Ala Gly Val Ser Arg Gin Gly Leu Trp Thr Arg Pro His His Val 
290 295 300 

« 
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<210> 15 

<211> 882 

<212> DNA 

<213> crtE gene 

<400> 15 

atgagacgag acgtcaaccc gatccacgcc 
60 

gcccagggat tcggtgccgt gtcgcagccg 
120 

tcgtcgggca ggcggttccg cggcatgctg 
180 

gtctgcgaca cgatcgtcga cgccgcctgc 
240 

atcttcgacg acctgccctg catggacgat 
300 

catgtggcgc atggcgaaag ccgtgccgtg 
360 

atggccctgc tggccggtgc gcgcggcgcg 
420 

atcctgtcgc ggtccctggg gccgcagggc 
480 

gcggccaaga acggcgcggg ggtcgaacag 
540 

atcgccgggc tggagatgct ggccgtgatc 
600 

atgatcgact ttggccgtca gctgggccgc 
660 

gtcgtgggcg accaggcggc gcttggcaag 
720 

ccgcggcgcg gccttctggc cgtgtcagac 
780 

agccgcgccc aactggacgc gatgctgcgc 
840 

gccctgctgg aacgggttct gccctacgcc 
882 



<210> 16 
<211> 293 



acccttctgc agaccagact tgaggagatc 



ctcggcgcgg ccatgagcca tggcgcgctg 



atgctgcttg cggcagaggc ctcgggcggg 
gcggtcgaga tggtgcatgc cgcatcgctg 
gccgggctgc gccgcggccg gcccgcgacc 



ctgggcggca tcgccctgat caccgaggca 
tcgggcacgg tgcgggcgca gctggtgcgg 



ctgtgcgccg gccaggacct ggacctgcac 



gaacaggacc tgaagaccgg cgtgctgttc 



aaggagttcg acgccgagga gcagacccag 
gtgttccagt cctatgacga cctgctggac 



gataccggtc gcgatgccgc ggcccccggc 



ctgcagaacg tgtcccgtca ttacgaggcc 



agcaagcgcc ttcaggctcc ggaaatcgcg 



gcgcgcgcct ag 
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<212> PRT 

<213> crtE amino acid 
<400> 16 

Met Arg Arg Asp Val Asn Pro lie His Ala Thr Leu Leu Gin Thr Arg 

15 10 15 

Leu Glu Glu He Ala Gin Gly Phe Gly Ala Val Ser Gin Pro Leu Gly 

20 25 30 

Ala Ala Met Ser His Gly Ala Leu Ser Ser Gly Arg Arg Phe Arg Gly 

35 40 45 

Met Leu Met Leu Leu Ala Ala Glu Ala Ser Gly Gly Val Cys Asp Thr 
50 55 60 

He Val Asp Ala Ala Cys Ala Val Glu Met Val His Ala Ala Ser Leu 

65 70 75 

80 

He Phe Asp Asp Leu Pro Cys Met Asp Asp Ala Gly Leu Arg Arg Gly 

85 90 95 

Arg Pro Ala Thr His Val Ala His Gly Glu Ser Arg Ala Val Leu Gly 

100 105 110 

Gly He Ala Leu He Thr Glu Ala Met Ala Leu Leu Ala Gly Ala Arg 
115 120 125 

Gly Ala Ser Gly Thr Val Arg Ala Gin Leu Val Arg lie Leu Ser Arg 
130 135 140 

Ser Leu Gly Pro Gin Gly Leu Cys Ala Gly Gin Asp Leu Asp Leu His 

145 150 155 

160 

Ala Ala Lys Asn Gly Ala Gly Val Glu Gin Glu Gin Asp Leu Lys Thr 

165 170 175 

Gly Val Leu Phe He Ala Gly Leu Glu Met Leu Ala Val He Lys Glu 

180 185 190 

Phe Asp Ala Glu Glu Gin Thr Gin Met He Asp Phe Gly Arg Gin Leu 
195 200 205 

Gly Arg Val Phe Gin Ser Tyr Asp Asp Leu Leu Asp Val Val Gly Asp 
210 215 220 

Gin Ala Ala Leu Gly Lys Asp Thr Gly Arg Asp Ala Ala Ala Pro Gly 

225 230 235 

240 

Pro Arg Arg Gly Leu Leu Ala Val Ser Asp Leu Gin Asn Val Ser Arg 

245 250 255 

His Tyr Glu Ala Ser Arg Ala Gin Leu Asp Ala Met Leu Arg Ser Lys 

260 265 270 
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Arg Leu Gin Ala Pro Glu lie Ala Ala Leu Leu Glu Arg Val Leu Pro 

275 280 285 

Tyr Ala Ala Arg Ala 
290 



<210> 17 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer for crt gene 



<400> 17 
gttccacgac tggggcatc 
19 



<210> 18 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ' reverse primer for crt gene 

<400> 18 

tccactgacc ttgttggaca aattgccg 
28 
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